ASCII \ Unicode

ASCII \ Unicode

Previous Top

American Standard Code for Information Interchange (ASCII)
A standard single-byte character encoding scheme used for text-based data. ASCII uses designated 7-bit or 8-bit number combinations to represent either 128 or 256 possible characters. Standard ASCII uses 7 bits to represent all uppercase and lowercase letters, the numbers 0 through 9, punctuation marks, and special control characters used in U.S. English.
Most current x86-based systems support the use of extended ASCII. Extended ASCII allows the eighth bit of each character to identify an additional 128 special symbol characters, foreign-language letters, and graphic symbols.Those 128 symbols are used to represent Greek, Cyrillic, Hebrew or Arabic characters.

Unicode

A character encoding standard developed by the Unicode Consortium that represents almost all of the written languages of the world. The Unicode character repertoire has multiple representation forms, including UTF-8, UTF-16, and UTF-32. Most Windows interfaces use the UTF-16 form.A 16 bit code allows for about 65,000 different representations. This is enough to encode the popular Asian languages like Chinese, Korean or Japanese and all other languages.